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MAXIMUM SMOOTHED LIKELIHOOD ESTIMATORS FOR THE INTERVAL 

CENSORING MODEL 

By Piet Groeneboom 

Delft University 

We study the maximum smoothed hkehhood estimator (MSLE) 
for interval censoring, case 2. Characterizations in terms of convex du- 
ality conditions are given and strong consistency is proved. Moreover, 
we show that, under smoothness conditions on the underlying distri- 
butions and using the usual bandwidth choice in density estimation, 
the local convergence rate is n~^'^, in contrast with the rate n~^'^ 
of the ordinary maximum likelihood estimator. In this situation the 
^^ MSLE is asymptotically equivalent to the solution of a non-linear in- 

tegral equation, which we solve using the implicit function theorem in 
Banach spaces. It is shown that, again under appropriate conditions, 
the MSLE has a normal limit distribution of which the expectation 
and variance are explicitly determined as a function of the underly- 
^ ing distributions. This is done by showing that the solution of the 

pH non-linear integral equation is sufficiently close to the solution of a 

r/^ linear integral equation, which, in turn, is sufficiently close to a "toy 

estimator" , depending on the underlying distributions, for which we 
can compute the asymptotic bias and variance explicitly. 
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I— ' 1. Introduction. In [8] the smoothed maximum likelihood estimator (SMLE) and maximum 

pg smoothed hkehhood estimator (MSLE) were studied for the current status model, the simplest 

^ interval censoring model. It is called the interval censoring, case 1, model in [5] and [11]. It was 

shown in [8] that, under certain regularity conditions, the SMLE and the MSLE, evaluated at 
a fixed interior point, converge at rate n~^'^ to the real underlying distribution function, if one 
■^ takes a bandwidth of order n^^'^. This convergence rate is faster than the convergence rate of 

(T^ the non-smoothed maximum likelihood estimator, which is n~^'^ in this situation, as shown in [5] 

^^ and [11]. Moreover, the limit distribution is normal, in contrast with the limit distribution of the 

^__l non-smoothed maximum likelihood estimator. 

IL" The more realistic interval censoring model, where there is an interval in which the relevant 

•^H (unobservable) event takes place, is much more common and also more important, in particular in 

rS medical statistics. It is called the interval censoring, case 2, model in [5] and [11]. A preliminary 

C^ discussion of the SMLE in this situation can be found in [10], where it was shown that the devel- 

opment of the theory of the SMLE for this model crucially depends on a further analysis of the 
integral equations, studied in [2], [3] and [4]. In the present paper we study the MSLE instead and 
prove a consistency and asymptotic normality result for this estimator. We also discuss algorithms 
for computing the MSLE, which is a rather non-trivial issue. 

The MSLE has the advantage over the SMLE that it can be used in situations where the SMLE 
cannot be used. As an example, the MLE itself is proved to be inconsistent for the current status 
continuous mark model (see [13]), and the SMLE will inherit the bad properties of the MLE in this 
situation, and also be inconsistent. On the other hand, a version of the MSLE, based on histograms, 
is proved to be consistent for this model in [9] . A similar phenomenon holds for the two-dimensional 
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2 P. GROENEBOOM 

right-censoring model, where the MLE is inconsistent (see [14]) and the SMLE will not make this 
better. In this case the MSLE will, under appropriate smoothing of the observation distribution, 
also be consistent. 

We recall the interval censoring, case 2, model. Let Xi, . . . ,Xn be a sample of unobservable 
random variables from an unknown distribution function Fq on [0, oo). Suppose that one can observe 
n pairs (Tj, Ui), independent of Xi, where Ui > Ti. Moreover, 

^ji = l{Xi<Ti}> ^j2 = ^{Ti<Xi<Ui} and Ajs = 1 - Aji - Aj2, 

provide the only information one has on the position of the random variables Xi with respect to 
the observation times Tj and Ui. In this set-up one wants to estimate the unknown distribution 
function Fq, generating the "unobservables" Xi. 

Interestingly, from a computational point of view, the MLE for the distribution function of the 
hidden variable in the case that one has more observation times Tj, C/j, T^, . . . "per hidden variable" , 
can always be reduced to the case of interval censoring, case 2. This follows from the fact that at 
most two of the observation times of the set {Tj, Ui,Vi, . . . } are relevant for the location of the 
hidden variable. If we know that the hidden variable is located between two observation times, 
while the other observation times for this hidden variable are either more to the right or more to 
the left, then these other observation times do not give extra information and can be discarded in 
computing the MLE. Likewise, if we know that the hidden variable lies to the right of all these 
observation times, all observation times smaller than the largest one do not give extra information, 
with a similar situation if we know that the hidden observation time lies to the left of the smallest 
observation time for this variable. So in the last two cases only one observation time gives relevant 
information and the other ones can be discarded. This motivates concentrating on the interval 
censoring, case 2, model, as an extension of the current status model. 

The MSLE for the interval censoring, case 2, model is defined in the following way. Let g be 
the joint density of the observation pairs {Ti,Ui), with first marginal gi and second marginal 52- 
Moreover, let the densities /iqi, /io2 and ho be defined by 

(1.1) hoi{t) = Fo{t)gi{t), ho2{u) = {l-Fo{u)}g2{u), ho{t,u) = {Fo{u) - Fo{t)}g{t,u). 

We define hnj, j = 1, 2, and hn as the estimates of the densities /iqj, j = 1, 2, and the 2-dimensional 
density ho, where 

(1.2) Ki{t) = -Y,Kb„{t-Ti)Aii, K2{u) = -Y,Kb^iu-U^)Ai3, 

i=l i=l 

1 " 

(1.3) hn{t,u) = -Y,^K{t- Ti) Kbju - Ui)Ai2, 

j=i 

and 

for a symmetric continuously differentiable kernel K with compact support, like the triweight kernel 

(1.4) K{x) = i (1 - x'^f l[-i,i](x), X G M. 
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At boundary points we use a boundary correction by replacing the kernel K hy a linear combination 
of K{u) and uK{u). For example, if t G [0, hn), we define 

Ki{t) = a{t/bn)-^Kb„{t - Ti) Ail + /5(t/6„) ^ Yl ^-JT^^bAt " Ti) A^, 

where the coefficients a{u) and /3(u) are defined by 

/u pu 

K{x)dx + f3{u) xK{x)dx = l,ue[0,l]. 

and 

/u pu 

xK{x) dx + (3{u) / x'^K{x) dx = 0,u£ [0, 1]. 

It may happen that hni{t) < 0; in that case we put hniit) = 0. 
If t G (Af — 6„, M], we similarly define: 

/i„,i(i) = a{{M - t)/hn)-Y^Kb„{t - Ti) Ail - /3((M - t)/6„)i V ^-^lliRb^t - Ti) An, 

where the functions a and /3 are again defined by (1.5) and (1.6). The estimates /i„2 and hn are 
similarly defined if one or more (in the case of hn) arguments have distance less than bn to the 
boundary; for hn we apply this to the factors of the product of the kernels separately, in the same 
way as for the one-dimensional estimates hnj- 

The MSLE Fn is now defined as the distribution function, maximizing the criterion function 

(1.7) i{F)= jhni{t)\ogF{t)dt+ fhn2iu){l-F{t)}du+ f Kit , u) log{F (u) - F (t)} dt du , 
as a function of the distribution function F. But in practice we discretize, and maximize: 

m m 

J2{hnl{ti)logF{ti)}di + Y{hn2{U)log{l - F{ti)}}di 

8=1 j=l 

m— 1 m 

(1.8) +Y. Yl {hn{ti,tj)log{F{tj) - F{ti)}}didj, 

i=l j=i+l 

where ^ = Iq <ti,- ■ ■ <tm are the points of a grid, and di = ti — tj-i, i = 1, . . . ,m. 

Note that the maximization of (1.7) is the same as the minimalization of the Kullback-Leibler 
distance 

/ ^"^ ^'^ '°^ mw) ^' + / '^"^ ^'^ '°^ {i-F(IL(t) "^^ 

/r / N , hn(t,u) , , 

hnit,u)log "^ ; dtdu 

{F{u) -F{t)}g[t,u) 

Throughout this paper, we will assume that the following conditions is satisfied. 



4 P. GROENEBOOM 

Condition 1.1. (Conditions on underlying distributions) The observation density g is contin- 
uous and bounded on its support S, which is the closure of an open connected set, contained in 
[0,M]^, and has the property 

sup g{t,u) = 0, 

u-t<e,0<t<u<M 

for some e > 0. We call this the strict separation hypothesis. Furthermore, we suppose that 

{{t,u):g{t,u)>0} 

is a set bounded by the two line segments connecting (0,0) and (0,M), and (0, Af ) and {M,M), 
respectively, and a smooth curve 

{(t,7(t):tG[0,M]}, 

where e < 7(0) < 7(M) = M and 7 is an continuous function which is strictly increasing on the 
interval [0,c], where c is defined by 

c = sup{t > : 7(t) < M}. 

The prototype of such a function is: 

^(f) = ! * + ^' *^ [O'Af-e), 
^^^ \ M, te [M-e,M]. 

We will in fact assume that (t,^{t)) runs inside the region bounded by two regions of this simple 
type for an e > and e' > e, where e' < M/2. 

We also assume that the underlying distribution function Fq of the "unobservables" is continuous 
on (0,M) and that Fq and the marginals gi and 32 of g satisfy 

(1.9) inf {gi{t){l - Fo{t)} + g2{t)Fo{t)} > 6, 

te(o,A/) 

for some 5 > 0. Condition (1.9) was also used in [2], [3] and [4]- We denote the supports of gi and 
52 by Si and S2, respectively, and assume that Si and S2 are closed subintervals of [0, M] such that 
G Si and M £ 52- 

Condition 1.2. (Conditions on the kernel estimators) We assume that hnj and hn are kernel 
estimators of h^j and ho, respectively, defined by (1-2) and (1-3), respectively, for a symmetric 
continuously differentiable kernel K of type (1-4), with compact support. Moreover, for points near 
the boundary, boundary kernels are used, with coefficients a{t) and (3{t), defined by (1.5) and (1.6), 
respectively, where the functions a, (3, and its derivatives a' and f3' are bounded on [0, 1]. Let a and 
b be defined by: 

(1.10) a = mflte[0,M]:hniit)y hniu,t) du > o\ , 
and 

(1.11) b = suplte[0,M]:hn2{t)y hn{t,u) du > o\ , 
We further assume that < a < b < M and that 

(1.12) hni{t)y j hn{u,t)du>0,te {a,M], 

Ju=0 
I'M 

(1.13) hn2(t)y I hnit,u)du>0,t£[0,b). 

Ju=t 
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An example of a kernel estimates, satisfying the conditions of Condition 1.2, is given by kernel 
estimates which use the triweight kernel, defined by (1.4). For this kernel, the weight functions a 
and /3, used in constructing the boundary kernel, are decreasing on [0,1], with a(0) ~ 6.11457, 
a(l) = 1, and /3(0) ~ 15.0476, /3(1) = 0. The corresponding derivatives a' and /3' are increasing on 
[0, 1], with a'(0) K, -40.8931, a'(l) = 0, and /3'(0) « -100.635, ^'(1) = 0. 

Using Condition 1.1, we give a characterization in terms of necessary and sufficient (duality) 
conditions for the MSLE in section 2. In that section we also prove consistency of the MSLE, using 
techniques, similar to the method, used in [11], Part II, section 4. 

In section 3 we discuss algorithms for computing the MSLE: the EM algorithm, an interior point 
algorithm and an iterative convex minorant algorithm. The iterative convex minorant algorithm is 
an adapted version of the algorithm, introduced in [5] and (again in) [11]. It turns out that the 
latter algorithm performs best in our experiments. The EM algorithm is very slow and therefore 
not suitable for larger sample sizes or simulation purposes. 

In section 4 we will prove asymptotic normality of the MSLE at a fixed interior point of the 
domain of definition (Theorem 4.1). To give an example of a situation where the conditions of 
Theorem 4.1 are satisfied, we take g uniform on the triangle with vertices (0, e), (0, 1) and (1 — e, 1). 
This example is also discussed in [10]. Then 

/I i.\ / ^ 2(1 -u-e) 2(-u - e) 

and g = 2/(1 — e)^ on the triangle of its support. Pictures of the MSLE and MLE are given in 
Figures 1 and 2, where we took 

Foix) = l-{l-xf,xe[0,l]. 

Note that the density /q vanishes at the right boundary point, so, strictly speaking, the conditions 
of Theorem 4.1 are not satisfied (indicating that these conditions are probably slightly stronger 
than needed). The triweight kernel was used as smoothing kernel and the bandwidth was simply 
bn = n~^'^. Boundary kernels were used in computing /i„i and hn2, and for hn we just used 
the product of the kernels, again using a boundary correction using boundary kernels in each 
component, with bandwidth bn = n~^'^ for both coordinates. The separating gap e was chosen to 
be £ = 0.1. Figures 1 and 2 show the rather large improvement of the MSLE over the MLE when 
the underlying distribution is smooth. 

In [8] an asymptotic normality result, analogous to Theorem 4.1 was proved for the current status 
model. In this case it is much easier to prove a result of this type, since we have an explicit charac- 
terization of the MSLE as the slope of the greatest convex minorant of a continuous cusum diagram. 
Moreover it was shown that the MSLE was asymptotically equivalent to a plug-in estimator, for 
which the asymptotic normality result could be easily proved. For the interval censoring, case 2, 
model, we have neither of these properties: the MSLE has to be computed by an iterative proce- 
dure, there is not an obvious characterization as a slope of a greatest convex minorant and there 
is no simple asymptotically equivalent plug-in estimator. Instead we have to rely on the implicit 
characterization, given in Lemma 2.1 of section 2. Furthermore, the role of the plug-in estimator is 
now played by the solution of a non-linear integral equation, see Lemmas 4.1 and 4.3. An implicit 
function theorem in Banach spaces is the basic tool for showing the existence and uniqueness of 
the solution to this equation, where the parameters are given by the estimates hnj and hn of h^j 
and /lo, respectively. 

In this paper we concentrate on the "separated case", where Ui — Ti > e for some e > 0, as in 
[2] and [3]. This case seems to be the most important case, and also to be the usual situation in 
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Fig 1. The MSLE (solid) and MLE (dashed) on [0, 1] for a sample of size n = 100 from the distribution function 
Fo{x) = 1 — (1 — x)^ (dotted); g is uniform on the triangle with vertices (0,£), (0, 1) and (1 — e, 1), where e = 0.1. 
The bandwidth for the computation of the MSLE was h„ = n~^'^ ~ 0.398107. 



Fig 2. The MSLE (solid) and MLE (dashed) on [0, 1] for a sample of size n — 1000 from the distribution function 
Fo{x) = 1 — (1 — x)^ (dotted); g is uniform on the triangle with vertices (0,£), (0, 1) and (1 — e, 1), where £ = 0.1. 
The bandwidth for the computation of the MSLE was b„ — n^^''' ~ 0.251189. 
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medical statistics. The non-separated case is rather different and has its own specific difficulties. The 
behavior of the MLE and SMLE in this situation is discussed in [4], [6] and [10], but the theory is 
still rather incomplete, even for the MLE. There is a conjecture for its asymptotic distribution, put 
forward in [5] and [11], but this conjecture has not been proved up till now, although a simulation 
study, supporting the conjecture is given in [10]. The theory for the MSLE in this situation has still 
not been developed. 

2. Characterization of the MSLE and consistency. Let, for an estimate h^ of /iq, satis- 
fying 

(2.1) hn{t,u) = 0, u-t < e, 

for some e > 0, the nabla function Vi? be defined by: 

^ ' ' F{u) l-F{u) y,=o^(«)-^W J,=u Fiv) - F{u) 

if < F{u) < 1. If F{u) = or F{u) = 1, we define Vf{u) = 0. 

Then, similarly to the ordinary MLE, the MSLE can be characterized by the so-called Fenchel 
duality conditions. 

Lemma 2.1. Let hn satisfy (2.1), for some e > 0, and let a and b be defined as in Condition 1.2 
(see (1.10) and (1.11)). Then the distribution function Fn maximizes (1.7) if and only if Fn{t) = 
0, t < a, Fn{t) = 1, t > b, and F„ is continuous on [0,M) and satisfies the conditions 



(2.3) / Vp^{v)dv<0,te[a,b), 

J v=t 

and 

(2.4) I Vp^{v)F4v)dv = 0. 

•J a 

Moreover, if t £ [0,M) is a point of increase of Fn, i.e., 

Fn{u) - Fn{u') > 0, for all u, u' G [0, M] such that u' < t < u, ,ift>0, 



^^■^^ ' Fn{u)>0, ue{0,M] , ift = 0, 

we have: 

(2.6) V^^(t) = and / Vpjv)dv = 0. 

Proof. It is clear that if Fn gives mass to [0, a) or [b, M], we can improve 1{F) by displacing mass 
to the interval [a, b). Therefore, we may assume that Fnit) = 0, t < a, F„(t) = 1, t > 6. 

Now first suppose that the conditions (2.3) and (2.4) are satisfied. Then we cannot have Fn{t) = 
for t in an interval where hni{t) > or ji^^r.hn{u,t) > 0, since otherwise i{Fn) = — oo. Similarly, 
we cannot have Fnit) = 1 for t in an interval where hn2{t) > or J^^^ hn{t, u) > 0. 

Since the criterion function F h^ (-{F) is concave in F, we get: 

pM pb 

(2.7) eiF) - i{Fn) < / VpJu){F{u) - Fniu)} du = / V p^(u){F{u) - F„(n)} du, 

Jo J a 
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where we use the facts that the mtegrals defining £{F) are all non-positive and that V^ (t) = 0, if 
t ^ [cL,b). Note that this is similar to the relation (1.11) in [11]. 
By (2.4), 



f VpJu)Fniu)du = 0, 

J a 



and hence: 



VpJu){F{u)-Fn{u)}du= / VpJu)F{u)du. 

J a 

If F = l[i,oo) for some t £ [a, b), we get by (2.3), 

/ Vp^{u)F{u) du= V^^^(u) du < 0. 
So we also get, for distribution functions of the type 



k 



F = ^ail[t^,oo), a<ti<...,tk<b,ai£{0,l),'^ai< 1, 
that 

rb '' pb 

/ V^^^ {u)F{u) du = ^ai / V^^ {u) du < 0. 

Ja j^_^ Jti 

Since we can approximate any distribution function F on [a, b] by distribution functions of this 
type, this implies £{F) < i{Fn), for all distribution functions F. 

Conversely, suppose that Fn maximizes i{F). Then we must have, if t E (a, b) and F = l[i^oo)i 

/ VpJv)dv = l[m6~^e(Fn + SF)-i{Fn)} < 0, 

J v=t "^^ 

(using the concavity of I for the existence of the limit), and hence (2.3) has to be satisfied for F^. 
Moreover, defining Fs by 

F5(t) = (l + 5)F„(t)Al, tG[0,M], 

we find: 

s^o 5 

since the limit has to be nonpositive, if we let 5 tend to zero, either from above or from below. 
We have: 

b 



lim = / V p {u)Fn{u)du, 



so (2.4) must hold. 

Suppose Fn has a jump at t G (a, b) and suppose V^ (t— ) > 0. Define 

F„(n), u<t-5, 
Fs{u) = { Fn{t), u£[t-5,t) 
Fn{u), U£[t,M]. 
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Then 

f V pJu){Fsiu) - Fniu)} du > 0, 
for small 6 > 0, a contradiction. Hence we must have: V^, (t— ) < 0. If V^ (t) < 0, we define 

{Fn{u), U<t, 

Fn{t-), U£[t,t + d) 
Fn{u), UG[t + 6,M], 

and then again: 

j V p^{u){Fs{u) - Fn{u)} du > 0, 

for small 6 > 0, a contradiction, so we must have: V p (t) > 0, implying we must have 
(2.8) VpJt-)<0<VpJt). 

On the other hand, we have by the continuity of hnj, j = 1, 2, and /i„ and the definition of a and 
b: 

Kl{t) hnl{t) hn2{t) , hn2{t) 



Vp (t)- Vp (t-) 

^" ^" Fn{t) Fn{t-) l-Fn{t) 1 - F„(t-) 



v=0 Fn{t) - Fn{v) Jv=0 Fn{t-) - Fn{v) 

v=t Fn{v) - Fnit) Jv=t Fn{v) - Fn^t-) 

contradicting (2.8). The conclusion is that we must have: Fn{t—) = F„(t), for t S (a, 6). 

If i = a, we either have: a = 0, in which case Fn is continuous at a by right-continuity, or we 
have Fn{t) = 0, t < a. Then, if Fn would have a jump at a, we define the perturbation 



' 0, u < a, 

F„(a)/2, u€[a,a + 6) 
^ Fn{u), ue [a + 6,M], 



F{u) = < 
Then we get, using condition (1.13) 

" 2{1 - F„(a)} ■ '' '"'"' Jv=t Fn{v) - Fn{t) 



Vp{a){Fs{a) - F„(a)} = — ^— - + -^Fn{a) / -^— ^— - dv > 

211 - Fnia) \ Jv=t FJv) - FJt) 



Hence 

limr^MF,)-£(F„)| = ^"^(°)^"^("^ +lF„(a) T , ^"^^'^.^ d. > 0, 
•^^o I' ' ' '/ 2{1-F„(a)} 2 ^ V.=tF„(z;)-F„(i) 

contradicting the assumption that F^ maximizes (.{F). So we must have Fn{o) = 0. Similarly, we 
either have & = M or we have b < M and Fn{b—) = 1. 
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Finally, suppose (2.5) is satisfied for a point t G ia,b) and suppose V^ (t) > 0. Then, by 
the continuity of F„, there also exists a neighborhood of t such that V^ (u) > for u in this 
neighborhood. We now define a perturbation Fs of F„ by: 

Fn{u), U<t-S, 

F5{u) = { Fn{t + S), u£[t-6,t + 6) 
Fn{u), ue[t + 6,M]. 



Then we have for sufficiently small 5 > 0: 

I V pJu){Fs{u) - Fniu)} du > 0, 
contradicting 

f V pJu){Fs{u) - Fniu)} du < 0. 

If (2.5) is satisfied for a point t S (a, b) and V p (t) < 0, we define the perturbation Fs of F„ by: 

{Fn{u), U<t-S, 

Fn{t-6), u£[t-6,t + 6) 
Fn{u), u^[t + 5,M]. 

and get a contradiction in the same way. So, if (2.5) is satisfied for a point t G (a, 6), we must have: 

v^„(t) = o. 

If a is a point of increase (which can only happen if Fn{u) > for -u > a), we argue in the same way, 
taking a right neighborhood of a. Since Fn cannot have points of increase t such that < t < a or 
b < t < M, this proves the left-hand side of (2.6). 
Furthermore, t E [a,b) can only satisfy (2.5) if 



nb 



rb 



b 



r-b rb 



no po " po pO 

/ VpJu)Fniu)du= -Fn{t) j Vpju)du + / VpJu)dudFnit) 

J a L J u=t \ t=a J t=a J u=t 

rb rb rb 

= Fn{a) Vpju)du+ / V^^ (u) du dPn (t) 

J u=a Jt=a J u=t 

rb rb 

(2.9) = / / Vp^{u)dudFn{t), 

Jt=a J u=t 

using the second equality in (2.4), implying by (2.3) that 

cb 

V p (u) du = 0, 

lu=t 

for points t satisfying (2.5), since otherwise the right-hand side of (2.9) would be strictly negative. D 

Note that if Vir(t) = for all t G (a, b), where Vj? is defined by (2.2), the conditions of Lemma 
2.1 are satisfied for F, and hence F would be the MSLE if it also would be a distribution function. 
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But unfortunately, the function F satisfying Vi?(t) = for all t G (a, 6), although usually uniquely 
determined, need not be monotone. We will call a function Fn, satisfying V^ (t) = 0, t € (a, 6), a 
plug-in estimator or naive estimator (as in [8]). This plug-in estimator will be further studied in 
section 4 in the proof of the local asymptotic normality of the MSLE, where it will be shown that 
the MSLE is indeed locally asymptotically equivalent to this plug-in estimator. 

Corollary 2.1. Let hn satisfy (2.1), for some e > 0, and let a and b be defined as in Lemma 
2.1. Then the distribution function Fn maximizes (1.7) if and only if Fn{t) = 0, t < a, Fn{t) = 1, 
t > b, and Fn is continuous on [0, M) and satisfies the conditions 

(2.10) VpJv)dv>0,tG{a,b), 

J a 

and 

(2.11) f Vpjv)dv = 0. 

J a 

Moreover, if t € [0,M) is a point of increase of Fn, i.e., satisfies condition (2.5) of Lemma 2.1, 
then 

(2.12) V^^(t)=0 and i Vpjv)dv = 0. 

J a 

Proof. Suppose -F„ maximizes i{F). Defining 

Fs{t) = {1 - (1 + 5)(1 - F{t))} V 0, t G [0, M], 
we find: 

(2.13) lim ^(-^'?) - ^(-^») = f V^ {u)\l - Fn{u)]du = 0. 

5^0 5 J a 

So if Fn maximizes ^{F), (2.11) follows from (2.13) and (2.4) of Lemma 2.1. 

rb 

Vp^ (u) du = 0. 

J a 

This implies: 

rt rb 

Vpjv)dv = - / Vp^{v)dv, 

and conditions (2.4) and (2.6) now also follow. Conversely, if the conditions of the corollary hold, 
we get: 

rb rb rb 



po po po pu 

/ Fniu)V pju) du = Fn{b) Vpjv)dv+ / V pjv) dv dFn{u) 

J a J a J t=a J v=a 

= Fn{b) I Vp^{v)dv = 0, 

J a 

implying condition (2.4) of Lemma 2.1. The other conditions of 2.1 follow similarly. D 
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We now simplify the conditions somewhat, in view of the iterative convex minorant agorithm, 
to be discussed in section 3. Multiplying Vp by F(l — F) yields the function 

Vf(.u) = hnl{u){l - F{u)} - hn2{u)F{u) 

™ hn{v,u) ^ /-^^ K{U,V) 



(2.14) + F{u){l - F{u)} r l^f'^t! , dv- [ 

Jv=o F[u) - F{v) Jy-_ 



._,F{v)-F{u) 



dvy 



Corollary 2.2. Let hn satisfy (2.1), for some e > and let the function Vp be defined by 
(2.14)- Then the distribution function Fn maximizes (1.7) if and only if Fn{t) = 0, t < a, Fn{t) = 1, 
t > b, and Fn is continuous on [0, M) and satisfies the conditions 



(2.15) / Vpjv)dv >0,te [0,M], 



and 

rM _ 

(2.16) J Vp^{v)dv = 0. 

Moreover, if t G [0,M) is a point of increase of Fn, i.e., satisfies condition (2.5) of Lemma 2.1, 
then 

(2.17) V^(t)=0 and Vp{v)dv = 0. 

Jo " 

Proof. Let a and b be defined as in Corollary 2.1. We then have, for t G [a, 6), 

[ Vpjv) dv= [ Fn{u){l - Fn{u)}Vp^{v) dv 
Jo " Ja 

We have: 

r-t 

Fn{u){l - Fn{u)}V p^{v) dv 

= Fn{t){l - Fn{t)} / Vpju) du- {I- 2Fn{u)} / V pjv) dv dFn{ 

J u=a Ja Jv=a 

= Fnit){l- Fn{t)} I Vp^iu) du. 
J u=a 

Hence condition (2.15) is equivalent to condition (2.10) of Corollary 2.1. Similarly, by Corollary 
2.1, condition (2.15) can be written 

M i-b r-b 

Vp^^iv) dv= F„(n){l - Fn{u)}Vp^{v) dv = F„(6){1 - F„(6)} / V pju) du, 

Ja Ju=a 

and is therefore equivalent to condition (2.10) of Corollary 2.1. D 

We now prove consistency of the MSLE. The proof is somewhat analogous to the proof of the 
consistency of the MLE in [11]. 



u 
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Theorem 2.1. (Consistency of the MSLE) Let Condition 1.1 he satisfied on [0,M] for 
the distribution function Fq and the observation density g. Moreover, let hnj and hn be kernel 
estimators of hoj and ho, respectively, of the type defined in Condition 1.1. Finally, let Fn be the 
MSLE of Fq. Then, with probability one, 

lim Fn{t) = Fo{t), 

n— >oo 

for each t £ [0,M). The convergence is uniform on each subinterval [a,b] of (0,M). 

Proof. By the assumption on the kernel estimates and the observation density g, we may assume 
that hn satisfies (2.1), for some e > 0, and ah large n. Let the function ip be defined by 

(2.18) 

4^{F;hi,h2,h)= I hi{t) log F{t) dt + h2{t) log{l - F (t)} dt + h{t,u)log{F{u) - F{t)} dtdu, 

Then we must have, if hj = hnj, j = 1, 2, and h = hn, 

lime-i {^'((l - e)Fn + eFo; hi, h2, h) - i;{Fn; /ii, /i2, /i)} < 0. 
This implies (see also (4.20) in [11]): 

(2.19) [§^;hnl{t)dt+ ['-^K2{t)dt+ [§^^Ut,u)dtdu<l. 

J tn\t) J I — tn\t) J tn\U) — tn[t) 

Fix a small 5 G [0, M/2] and let the intervals As and Bs be defined by 

Ai = [5,M], Bs = [Q,M-5]. 

Moreover, if {{t, u) : g{t, u) > 0} = {(t, n) : u — t > e], let Cs be defined by 

C5 = {{t,u):t< u, d{{t, u), {t, 7(i)) > 6}, 

where 7 is the function defining the boundary of the support of g, as in Condition 1.1, and d denotes 
the Euclidean distance. Then, arguing as in [11], Part II, Chapter 4, we find that there exists an 
M > such that for all n, 

sup 1/Fn{t) + sup 1/{1 - Fn{t)} < M. 
t&As t&Bg 

We also cannot have that Fn{uk„) — Fn{tk„) — ?• 0, for a sequence of points {tk„-,Ukn) G Cs. For 
suppose, if necessary by taking a subsequence, that t^^ — )• to and Uk^ — )• uq. The UQ — tQ>e + 5. By 
the vague convergence of Fn to F, there are continuity points f 1 and ui such that to < ^i < ^1 < ^O) 
til — ti > ^(^ + e, and Fn{ti) — >• F{ti) and F„(ni) — )■ F{ui). Moreover, since 

Fn{ui) - F„(ti) < F„(nfcJ - F„(tfcJ, 

for large n, we must have: F{ui) — F{ti) = 0. We then would get that there exists a rectangle 
[ti,t2] X [u2,ui] such that U2 — t2 > e and 



liminf / — hn{t,u) dtdu 

'^^°° J\hM]x\u,.,u,] FJu)-Fn(t) 



> K{Fo{u2) - Fo{t2)} I ho{t,u)dtdu, 

J[tl,t2]x[u2,Ui] 
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for any K > 0, contradicting (2.19). So we may also assume that 

inf {F„(n)-F„(t)}>-^, 

for all n. 

As in [11], Part II, Chapter 4, we have by the Helly compactness theorem that the exists a set of 
probabihty one, such that for each w in this set the sequence (F„(-; w)) has a subsequence F„j.(-; <^) 
which converges vaguely to a subdistribution function F = F(-;uj). By the vague convergence of 
Fn^{-;uj) to F we now get: 



Af, Fn^ (t; cj) Jbs 1 - Fnk {t; co) Jcs Fn^ [u] w) - F„^, (t; w 

^hoi{t)dt+ f \ll^h,,{t)dt+ I ^!f\^3(^hoit,u)dtdu,n^oo. 
As -^(^) is, 1 - P^it) Jcs F{u) - F{t) 

By monotone convergence we now also have: 

/ ^ho,it)dt+[ \^ho,it)dt+[ J;\-lfh oit,u)dtdu 
J[o,M] ^(*) J[OM] 1 - ^(*) J[o,M? ^W - ^(*) 

(2.20) 

Sio [J As F{t) JB,l-F{t) Jc, F{u)-F{t) J 

Suppose F{t) / i^o(^) for some t G [0,M/2]. Then there exist a n G (t,M) such that ho{t,u) > 
and 

Fojt)' {l-Foju)}' {Foju) - Fojt)}' 
F{t) l-F(n) F{u)-F{t) ' 

since 

i^ ^ {l-Fo(n)}^ ^ {Fo(n) - Fo(t)}2 / = 1, Fo{t) = x, Fo(n) = y, 



X 1 — y y — x \ > 1, otherwise. 

(see also (4.27) in [11]). By the continuity of Fq and the monotonicity and right continuity of F 
there exist therefore also h > such that 

Fojtr , {l-Fo(nOP {Foju') - Fojt')}^ 

F{t') l-F(u') F{u')-F{t') ' 

if t' G [t, t + /i] and u' G [u, u + /i]. This implies 

^'^^hoiit)dt+ f l-lMlho2it)dt+ f ^'^;l~ifh oit,u)dtdn 



[0,M] F{t) J[0,M] 1 - ^(*) ^[0,Af]2 i^(n) - Fit) 

f Foitf ,,, /■ {l-Fo(t)}2 ,,, /■ {Fo(n) - Fo(t)}2 , ,, , 

y[0,Af] ^(*) 7[0,Af] 1 - F{t) J[o^M]'^ F{u) - F{t) 

Fojtr {l-Fojt)}' {F,{u)-Fo{t)}' ] 

-^ ^ Ev^N ^ ^^r~\ Ev^^ — r ^(i, -") at du > 1, 



/[o,A/p I F{t) 1 - F{t) F{u) - F{t) 

in contradiction with (2.20). So we must have F{t) = -Fo(^) if i G [0,M/2]. A similar argument 
yields F{t) = Foit) if t G [M/2, M). 

So, for each w outside a set of probability zero, the sequence (F„(-; w)) has a subsequence which 
converges weakly to Fq. This implies that Fn{t) converges almost surely to Fo{t) for each t G [0, M). 
The uniformity of the convergence on subintervals follows from the continuity of -Fq- D 
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3. Algorithms. We start by recalling how the MSLE is computed for current status data. As 
explained in [8], the MSLE can be computed by first constructing the continuous cusum diagram 



(3.1) (^Gn{t),Hn{t)j,te[0,M], 
defining the densities Qn and /i„ on [0, M] by 

^ n 1 '^ 

(3.2) gn{t) = -Y,Kb^it-Ti), hn{t) = -Y,KbA^-T,)l^i, 
and Gn and Hn by: 

Gn{t) = I gniu)du, Hn{t) = I hn{u) du, t £ [0,M], 

Jo Jo 

where 

\ = 1{X,<T,}, 

noting that for the current status model we just have the observation times Ti and the information 
whether the unobervable event of interest Xi is to the left or to the right of Tj. 

Then the slope of the greatest convex minorant of the cusum diagram (3.1) defines the MSLE 
Fn and F„ maximizes the smoothed log likelihood 

I'M r-M 

/ hn{t) log F it) dt+ {gnit)-hnit)}log{l-F{t)}dt. 

Jo Jo 

One can improve the estimates (3.2) by using boundary kernels, but this is not relevant here, the 
main point is that the MSLE can in this case be computed by a one-step algorithm which just 
computes the greatest convex minorant of the cusum diagram (3.1). This parallels the situation 
for the MLE in the current status model, which can also be computed by constructing a cusum 
diagram and the corresponding convex minorant. Such a simple one-step algorithm does no longer 
exist for the computation of the MLE for the interval censoring, case 2, model, and the situation 
is similar for the MSLE: we have to use some iterative procedure. 

The EM algorithm is based on the following "self-consistency equations" 

\jt F„{v) Jo 1 - ^(t.) Jv<t<u F„(u) - F„(») J ' " 

where fn{t) = Fni't)- This yields the iteration steps 

(3.3) 

f^'^'Ht) = ir^^dv+ r^4^^-+ / ^J:^"^"Lr. dvdu]f(^\t). 

^' \Jt F(k){v) J^i-F(k){v) y.<«„FW(n)-FW(^) J^ ^' 

One can indeed use a discretized version of (3.3) to compute the MSLE, but the EM algorithm is 
(as is usual for this type of problem with many parameters) very slow. Simply enhancing the EM 
algorithm by a Newton step is also not helpful because of the many constraints the solution has 
to satisfy, leading to very small "feasible steps" . For this reason a Newton-improved EM algorithm 
does not improve very much on the EM algorithm itself. 
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An algorithm that we found very stable and rather fast is the interior point method with a 
logarithmic barrier function, which keeps the path to the solution away from the boundary. In this 
case one maximizes the criterion function 

m m 

(l,^{F) = Y,{Ki{ti)\ogF{U)]d, + Y,{hn2{ti)\og{l- F{U)]]di 

m— 1 m m+l 

(3.4) + 2Z IZ {hn{ti,tj)\og{F{tj) - F{U)]]didJ+^^Y, ^oe{F{U) - F{U^i)] , 

instead of (1.8), where F{to) = and F{tm-\-i) = 1, and lets the parameter fi tend to zero from 
above after completing an interior iteration loop, maximizing (3.4) for a fixed /x. In the interior 
iteration loop one uses the Hessian matrix, corresponding to (3.4), and solves the maximization 
problem by doing Newton- Raphson steps in maximizing (3.4). The iterations are started with n = 1 
and n is halved after completing an interior iteration after which (3.4) is maximized again, starting 
with the value of F, obtained in the preceding inner iteration. 

In our experience the fastest algorithm, however, is a combination of the EM algorithm with a 
version of the iterative convex minorant (ICM) algorithm, introduced in [5] and [11]. This procedure 
has the added advantage that one does not have to solve a rather large matrix equation, using the 
Hessian matrix, as is needed in the interior point method. Instead of the cusum diagram (3.1) we 
use a sequence of cusum diagrams 

(3.5) (wPit),VJ,''\t)^,t€[0,M],k = 0,1,2,..., 

for which we compute the greatest convex minorants at each A;th step. We alternate this with an 
EM-step (the combination is sometimes called the "hybrid algorithm"). The cumulative weight 

(k) 

function Wn is of the form 

WP = I wi^\u) du, t > 0, 
Jo 

(k) (k) 

for suitably (but somewhat arbitrarily) chosen weights Wn , and the cusum function Vn is oi the 
form: 

vP{t)= [ F'^''\u)wi^\u) du + [ Vpi,){u)du,t>0, 
Jo Jo 

where, for a distribution function F, \/p is the function, defined by (2.14), evaluated at -F = F^ K 
The idea is that the iterations force the Fenchel duality conditions (2.15) and (2.16) to be satisfied 
at the end of the iterations. 

The following weight function, chosen by taking the diagonal elements of the Hessian matrix, 
corresponding to the function Vp, gave good convergence results in our simulation study of the 
MSLE: 

- {1 - 2FW(t)} I r ,,,,^"^"'1, , dn - r J''}''''],,,, du] 

/■* hn{u,t) f^' hn{t,u) 

To prevent divergence of the algorithm, Armijo's line search method, as implemented in [12], was 
used for determining the step size at each iteration. 
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Note that, in the case of current status data, the function Vp is just given by: 

(3.6) Vf = hn{t) - ~9n{t)F{t) , 

and that we can compute F^ in one step this case, using the cusum diagram (3.1). 

4. Asymptotic distribution. In this section we wih prove the following theorem. 

Theorem 4.1. Let condition 1.1 he satisfied. Moreover, let Fq he twice dijjerentiable, with a 
hounded continuous derivative /o on the interior of [0,M], which is hounded away from zero on 
[0,M], with a finite positive right limit at and a positive left limit at M. Also, let /o have a 
bounded continuous derivative on (0, M) and let gi and g2 he twice differentiahle on the interior 
of their supports Si and S2, respectively, and let giF^ + (72(1 — -^0) stf^y away from zero on [0, M]. 
Furthermore, let the joint density g of the pair of observation times {Ti, Ui) have a bounded (total) 
second derivative on the interior of its support S, having finite limits approaching the boundary of 
S. Suppose that Xi is independent of {Ti., Ui), and let dp^ he defined by 

, I ^ Fo{v){l-Fo{v)} 



gi{v){l - Fo{v)} + Fo{v)g2{v) ' 
Then, if bn ^ n^^'^, we have for for each v G (0, M), 

V^ |f„(^) - Fo(t;) - ^^ } A iV (0, a(.)2) 



where 



'^^ ' gi{v){l - Fo{v)} + Fo{v)g2(v) ' ^ ' 



, , w^ho(t,v) r^^ ^ho(v,u) , , ., 

(4.1) +dFoiv){ Jf, I' , dt- / Jf.^L' du) / u'K{u)du, 

and 

(4.2) .,(.) = i+dp»{/ ^,f°l,,, it+f ^ ."';•'■'> , d„. 

lJt<v Fo{v) - Fo{t) y^>„ Fo[w) - Fo{v) 

and where N (0, 0"(w)^) is a normal distribution with first moment zero and variance cr{v)'^ , defined 
by 

2 dpoiv) f „, .2 



(4.3) a(vy = -^^ / KiuYdu. 

(yi{v) J 

Before embarking on the proof, we motivate the approach taken. Our starting point is given by 
the duahty conditions (2.15) and (2.16). It is clear that if we would have equality in (2.15) instead 
of inequality, we would get the following relation by differentiating w.r.t. t: 

Vpit) = hnlit){l - F{t)} - hn2{t)F{t) 
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Conversely, if F solves (4.4) for each t G (0, M) and F is a distribution function, F also satisfies 
(2.15) and (2.16) and is therefore the MSLE. 

Replacing hnj by /loj, j = 1,2, and hn by /iq, we obtain the equation 



' ^°(^'^) dv-T ^i^^du)=,, 



hoi{t){l - Fit)} - ho2{t)Fit) + F{t){l - Fit)} \ [ 

[Jv=o F^[t) - F^iv) Ju=t Fiu) - Fit) 

which, using the definition of /iqj and ho, turns into 
9i(t)F„(t){l - F(t)) - 92(t){l - F„(t)}F(t) 
+ F(t){l - Fm i f !I(''-'»FM-F,(u)} ^^ _ r" slt.utmu) F,(t) ^^ 



v=0 



Fit) - Fiv) A=, Fiu) - Fit) 



This equation is clearly solved by Fq itself. 
This motivates us to consider the equation 

cPit;hi,h2,h,F) = 0, t E [0,M], 

where 

(/.(t; /ii, /i2, h, F) = h,it){l - Fit)} - h2it)Fit) 

The functions h belong to a closed subset of the Banach space CiS), where S is the support of g, 
see Condition 1.1. By Condition 1.1, S is contained in the set 



{(t,n) -.u-tye} 



for a fixed e > 0. 

We now define the space E by 



(4.6) E = C[Si] X C[S2] X CiS) x C[0, M], 

where Sj if the support of gj (see Condition 1.1), and put the following norm on E: 

(4.7) ||(/ii,/i2,/i,F)||=max{||/ii||,||/i2||,||/i||,||F||}, 

where the norms on the right-hand side denote the supremum norm, which we also denote by || • ||. 
Note that i? is a Banach space for the norm (4.7). 
We will also need another norm on C[S], defined by 

(4.8) \\h\\s= sup \[ \hiu,t)\du+ f \hit,u)\du\. 

te[0,A/] yJu:{u,t)eS Ju:{t,u)eS J 

Note that this is indeed a norm on C[S], since \\h\\s = implies h = and since the triangle 
inequality and homogeneity property for scalars are obviously satisfied. 

Lemma 4.1. Let Fq, hoi, ho2 and ho satisfy the conditions of Theorem, 4-1- Furthermore, let the 
function (p be defined by (4-5). Then there exists for all small t] > an open set U in the Banach 
space C[Si] x C[S2] x CiS), endowed with the norm 

||(/ii,/i2,/i)|| = max{||/ii||, ||/i2||,||/i||}, 
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such that, if {hi,h2,h) G U, the equation 

(t){t-hiM,KF) = 0, tG [0,M], 

where 4> is defined by (4-5), has a unique solution F in the open ball B{Fo,rj) C C[0,M] with 
midpoint Fq. 

Proof. To avoid messy notation, we extend the definition of h to [0, M]^, by defining h{t, u) = 0, if 
(t, u) ^ S and hj{t) = 0, if i ^ Sj. We will now use the line of argument of the proof of the implicit 
function theorem 10.2.1 in [1]. We define the function cj) by: 

(4.9) [Hhi,h2, h, F)] (t) = <p{t, hi, /i2, h,F),te [0, M], 

so (/) maps E to C[0, M]. The derivative of (f) w.r.t. F, is given by the function 
[[di4^{hiM,h,F)\{A)\{t) 
tf - (,.,«, . ..,), Ait) . (1 - 2f «,} {£ ^|M_ ,„ _ £ _MM_ ,„} ,„ 

\A.o {F(t)-F(a)}2 j„., {F{u)-F(t)Y' J' 

where A G C[0, M]. Note that the right-hand side is well-defined for F G B{Fq, j]) and small ij > 0, 
since /i(t, u) = if u — t < e, and since Fq has a non- vanishing derivative on [0, M], implying that 
Fq{u) — FQ{t) stays away from zero if u — t > e. 

We now define the open set U = Us of functions {hi, /i2, h) by 

(4.10) Us = {ihi,h2,h)eC[Si] xC[S2] x C[S] : max{||/ii - /ioi||, ||/i2 - /i02||, ||/i - /iolls} < ^^l • 
There exists a 6 > such that for (hi, h2,h) G Us, 

p(/ii, h2, h, Fi) - 4>{hi,h2, h, F2) - [d4^{hoi,ho2, ho, Fq)] (Fi - Fs)]] < e' \\Fi - F2II , 

if Fi,F2 G B{FQ,r]), where e' > can be made arbitrarily small by making 5 small. 
The equation 

[di4>{Fii;hiii,hii2,ho,Fa)]{A) = Q 

only has the trivial solution A = in C[0,M]. This is seen in the following way. Suppose there 
exists a solution in ^4 G C[0, M] such that \di4>{Fo; hoi, /102, ho, Fq)] (A) = and A{s) > for some 
s G [0, M]. Then also inaXg^[o,M] A{s) > 0. Suppose the maximum is attained at i G [0, M]. Then: 

[[d4^{hoi,ho2,ho,Fo)]{A)]{t) 

= - {hoiit) + ho2m A{t) + {1 - 2Fo{t)} \ f y^'^\ dv- r l^f'^'l du]A{t) 

pm/1 Pmi/r ho{u,t){A(t) - A{u)} f'^ hoit,u){Ait)-Aiu)} \ 

- Fomi - Foit)} ^l^^ {Foit)-Foiu)r ^^ + L {Foiu)-Foit)V '""j 

= - {gi{t){l - Fo{t)} + 52(t)Fo(t)} A{t) 

pm/i pmi/r ho{u,t){A{t) - A{u)} r'' ho{t,u){A{t)-A{u)} \ 

- Fomi - Foit)} ^l^^ {Foit)-Foiu)V '^ + L {Foiu)-Foit)V '""j 

< - {gi{t){l - Foit)} + 52(t)Fo(t)} A{t) < 0, 
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using gi{t){l — -Fb(i)} + g2it)Fo{t) > 0, in contradiction witli the assumption 

[d4^{hoi,ho2,ho,Fo)]{A) = 0. 

We similarly get a contradiction if we assume that A(t) < for some t G [0, Af ] (similar arguments 
were used for the integral equation, studied in [2]). This shows that di(j){hoi, ho2, ho, F) is a linear 
homeomorphism of C[0, M] onto C[0, M] and that we can in fact use arguments of the type used in 
the proof of the implicit function theorem in Banach spaces, as given, for example in [1], Theorem 
10.2.1. 

Denoting (as in the proof of Theorem 10.2.1 of [1]) the linear mapping d4,(j){hoi, ho2, ho, Fq) by 
To and its inverse by TJ~ , we find that 

\\T^^ ■ {^{h,h2,h,Fi) - ^{hi,h2,h,F2)} - {F^ - F2)\\ < e'\\T^^\\\\Fi - F2\\ < ^||Fi-F2||, 

so we have a contraction, and this implies that the equation 

F = F-To^-^{;F;hi,h2,h) 

has a unique solution F G B{Fo,r]) which can be obtained by successive approximations, if we take 
the balls around hoj and ho, to which hj and h belong, respectively, sufficiently small, using a result 
like 10.1.1 in [1]. This, in turn, implies that the equation 

^{hi,h2,h,F) = 

has a unique solution in F G B{Fo, rf), for {hi, /i2, h) G Us and small 5. D 

Having established the existence of a solution, we now consider the derivative of the solution. 

Lemma 4.2. Let, under the conditions of Lemma 4-1, for a small rj > 0, F £ B{Fo,i]) be the 
solution of 

(t>{t;hi,h2,h,F) = 0, tG [0,M], 

where (p is defined by (4-5), and where hj has a bounded continuous derivative on the interior of 
Sj, having finite limits approaching the boundary of Sj, for j = 1,2. Similarly, we suppose that h 
is difjerentiable on the interior of its support S and has finite limits approaching the boundary of 
S. Then, if {hi,h2,h) G Us, where Us is defined by (4-10), the solution F has a continuous and 
bounded derivative for sufficiently small rj and 6. 

Proof. If 0(-; hi, /12, h,F) = we have: 

{l-Fit)}hiit)lsdt)-F{t)h2it)lsAt) 



+ F(t){l-F(t)}|£ 



^^^'*) du-f -^^^^^^du\=0. 



(n,t)GS Fit) - Fiu) Ju:(t,u)^S ^W " ^W 
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Differentiation w.r.t. t, and defining / = F' , yields: 

{1 - F{t)]h',{t)lsAt) - F{t)h',{t) - f{t){h^{t)ls,{t) + h2{t)ls,{t)} 

h{u,t) f h{t,u) 

,,,)e5 F{t) - F{u) "" ~ L(i,„)e5 F{u) - F{t) 



+ {1 - 2F{t)}m I [ ^.^^^'!^. . du - I -J'^^m^ du 



+ Fit){l-Fit)}\ f 



d,h{u,t) ^^_ r dMt,n),^ 



■.{u,t)es Fit) - F{u) Ju:it,u)^s F{u) - F{t) 

(4.11) 

Temporarily replacing F hy Fq, and hj and h by /iqj and /iq, respectively, we would obtain: 

{1 - Fo{t)}h'oAt)lsAt) - Fo{t)h'o2{t)ls,{t) - f{t){{l - Fo{t)}gi{t)lsAt) + Fo{t)g2{t)ls,{t)} 

I T?/^\[t T? un I f d2ho{u,t) f dihoit,u) \ 

+ Fo t){l-Fo t }< / -— —--du- -—— ^^Y^du\ 

- mFo{t){l - Foit)} I [ ,j,.'!fX\ U2 '^^+/ JFfy'wm2 ^4=°- 

[Ju:iu,t)es {Fo{t) - Fo{u)y Ju:{t,u)&s i^oW - Fo{t)y J 

that is: 
fit) I {{1 - Fo{t)}gi{t)ls,{t) + Fo{t)g2{t)ls,{t)} 



+ Foit){l Foit)} ^l^^^^^^^ l^^(^) _ ^^(^)p -- . y^^^^^^^^^ ^^^(^) _ ^^(^)p 



^°^^''^ dn+ /■ ._^°(^'!L.. ^n 



{1 - Fo(t)}/i'oi(t)ls,(t) - Fo(t)/i'o2(t)ls,(t) 



d2ho{u,t) j^ /■ diho{t,u) 



:(n,t)e5 -^o(i) - i^o(n) Ju:{t,u)es Fo{u) - Fo{t) 

This means that the coefficient of f{t) in equation (4.11) stays away from zero if F belongs to 
a sufficiently small ball B(Fo,r]) around Fq, and (/ii,/i2,/i) G f^(5 for small 6 > 0. Denoting this 
coefficient by c{t, /ii, /i2, h, F), we get the equation 

^(^)= c(t,/..L/.,F) {^^-^(^)>^^^(^)-^(^)^^^(^) 

(4.12) 

'■* 92/i(u,t) ^ f^^ dih{t,u 



+F{t){l - F{t)} I f „:.:'^"17 . (in- / ^7r^"'Z.. dn 



.oF(t)-F(n) J.,=tF{u)-F{t) 

The statement of the lemma now follows. D 

The following lemma will be used to show that, with probability tending to one, Fn belongs to 
the allowed class, for all large n, and is a consistent estimate of Fq. 
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Lemma 4.3. Let, under the conditions of Lemma 4-1, F"^^' £ B{Fq,vi) be the solution of 

^{t; h^^'^ , /i^") , /i(") , F) = 0, t G [0, M] , 

where (/) is defined by (4-5), and where h^^' G C[S], h^ G (^[Si] an /ig £ C'[5'2] are non-negative 
functions which have hounded continuous derivatives on the supports S, Si and S2, respectively, 
with finite limits approaching the boundary, respectively. Furthermore, let 



(n) 



h 



Oj 



%^^)'-h' 



Oj 



0, 



j = l,2. 



-7- 0, and 

where, as before, \\ ■ \\ denotes the supremum norm on C[Sj\. Finally, let 

(4.13) ||/i(")- /lolls ^0 and ||9j/i(") - 9j/io||5 ^ 0, j = l,2, 

where \\ • \\s is defined by (4-8) ■ Then F^^' — t- Fq in the supremum metric, as n —)• 00, and F^"' is 
strictly increasing on [0,M] and satisfies F^"''{t) € [0, 1], t G [0,M], for all large n. 

Proof. By Lemma 4.1 we have that F^"'' tends to Fq in the supremum norm on C[0, M], since for 
any r] we can choose a 6 > such that 



|FW-Fo||<r?, 



if \\hp - hojW < 6 and ||/i(") - ho\\ < 6. 

Using Lemma 4.2 we get that, under the conditions of the lemma, that F^"^' is differentiate with 
a bounded derivative /'"•', see (4.12). Specificahy, (4.12) yields: 



/W(t)1^^(F("))'(t) 
= iF(")(t){l-F(")(t)} 



d2h'^''\u,t) _ r^^ dih^''\t,u) 



+ {l_FW(t)}(/i("))'(t)_i7W(t)(4"))'(i)l /c(t,/j("),4"),/i(«),F(")) 



where c(i, /i^" , /i2 > ^ i -^ ) stays away from zero, as n — )• cxd. The corresponding density /o of 
the underlying model similarly has the representation 



foit) = \ Foit){l - Fo{t)} { f 



d2ho{u,t) 



du 



diho{t,u) 



du 



(4.14) 



)^s Fo{t) - Fo{u) Ju:(t,u)esFo{u)-Fo{t) 

+ {1- Foit)}h',,it) - Foit)h',,{t) I / co(t) 



where co{t) is given by: 
By 



\hf-hoj 



co{t) = gi{t){l - Fo{t)} + g2{t)Fo{t). 
1^0, \\h(") - hoW ^ 0, ||fW-Fo| 



and (4.13), we now get: 



sup 

te[o,M] 



c(t, /iS'^\4"\ /.("), FW)-co(i) 



0. 



0, 



MSLE FOR INTERVAL CENSORING 



23 



Again using (4.13), we also get: 



ll/")-/o||^0, 



that is: /'""-' converges to /o in the supremum norm. Since /o stays away from zero on [0,M], this 
means that F^"-) is strictly increasing on [0, M] for all sufficiently large n. 

Furthermore, since (j){t, h" , hi^ , /i^"', i^*-"^) = 0, we get for large n, and t in a right neighborhood 
ofO, 



F(")(t) 



/.(")(t) 



h^r\t) + 4"^(t) + {1 - FW(t)} c^ ,ji:;%%^ du 



>o, 



since, by the convergence of F^^> to Fq, we may assume 1 — F^^>{t) > for t in a neighborhood of 
0, and since h"" and /i'"-* are non-negative. 

Likewise, if Fo{M) = 1, we have, for t in a small left neighborhood of M, 



i7W(t){l_i?W(t)} 



/i(")(f,t) 



dv 



= {4"^(t) + 4"^(t)} F^")(t) - /iS"^(t) + F(")(t){l - F(")(t)} 

= h^^\t)F^''\t) - /iS"^(t){l - F(")(t)}, 
and hence, for t in a small left neighborhood of M, 



M 



/i(")(«,i) 



,=tFi^){t)-Fi^){v) 



dv 



{l_i7H(t)} Ji7W(t) 



/i(")(7;,t) 



=o-F('')(0-^^"H^') 



dt; + /ii")(t)l=4"^(t)F(")(t), 



implying that, for all large n, 1 — F^"'>{t) > for t in a neighborhood of M. This will a fortiori hold 
if Fo(M) < 1. This shows that, for all large n and ah t G [0, M], F('^)(t) G [0, 1]. □ 



We still need to show that the estimates hnj of hoj and /i„ of ho have the properties of hj and 
/i, as defined in Lemma 4.3. 

Lemma 4.4. Let the conditions of Theorem 4-1 be satisfied and let the estimates hnj of and hn 
satisfy Condition 1.1. Then 



nj 



k 



Oj 







anc 



'^nj ^Oj 



Ao^o 



Moreover, 
(4.15) 



\hn — hoWs 



and \\djhn — djhoWs 



0, 



,i = i,2. 



j = l,2. 



Proof. Since we use boundary kernels near the boundary of [0, M], hnj{t) is a consistent estimate 
of hoj{t) for each t G Sj. For if t G [&«, M — 6^] n Sj we just have: 



Ehni{t) = EAuKb„{t-Ti 



^br,{t - u)hoi{u) du = hQi{t) + 0{n 



-2/5x 



where the remainder term is uniform in t G [bn, M — 6„.] H Sj. Since 6„ i 0, we have bn < £ for all 
large n, where e is the "separation parameter" of Condition 1.1, and hence the boundary kernels 
are only relevant for /i„i in a neighborhood of and for hn2 in a neighborhood of M. 
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If t G [0, bn], we have: 



Ehniit) = ait/bn)EAuKb„{t - Ti) + /3{t/bn)E \ An- 

M 



t-T, 



-Kr,_ (t - Ti 



M 



t — U 



Kb„it - u)hoi{u) du 



= a{t/bn) / Kb^{t-u)hoi{u)du + (3{t/bn, . 

Ju=0 Ju=0 On 

rt/bn 

= hoiit) / {a{t/bn)K{u) + ^{t/bn)K{u)} du + 0(n-2/5) 

Ju=-1 

= /ioi(t) + 0(n-2/5), 
again uniformly for t G [0, 6„]. A similar computation can be made for /i„2 if i G [M — bn, M]. Since 



sup 



hnlit) - Ehnl{t)\ = Op (n'^/^y^k^^ 



we now get the uniform convergence in probability of /i„i to /iqi on Si, and similarly we have 
uniform convergence in probability of /i„2 to /102 on 82- 

Next we consider the derivative of hni{t). If t G [bn,M — bn] n 5i we just have: 



E~K,,{t) = j^EAiiKiit-Ti] 



Kb„it - u)hoi{u) du 



= bn / K' {u)hQi[t — bnu) du 

= 6;i j K'{u) {/loi(t) - bnuh'^,{t) + \blu^K^{t) - \blu^hl^{t)] du + o {bl) 
= h',,(t) + 0{bl)=h',,{t) + 0(n~'l'] 
again uniformly in t. Since 



sup 

te[6„,M-6„]n5i 



h'^^it) - Eh'^^{t)\ = Op (n-i/syi^) , 



we only have to consider what happens near the boundary. 

In treating the boundary kernels, we denote for simplicity 6„ by 5. If t G [0, 6], we have 



t+b 



Ehni{t) = a ( H / Kbit - x)hoi{x) dx + (3 



x=0 



t\ f^+U-X 



Kbit — x)hoiix) dx 



x=0 



This can also be written 

Ehniit) 

We write this in the form 

Ehniit) = hoiit) + 



i a ( - j Kiu) + /3 ( ^ ) uKiu) } h^iit - bu) du. 



t/b 



a ( - j Kiu) + /3 ( 7 ) uKiu) W iw — t + bu)hQiiw) dw du, 
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using a second order Taylor development of /iqi with the integral remainder term. Hence 



i^''"^<^'^="''^<^'^^H4l)4l)^^i'i)lHl 



whQi{w) dw 



+ 



1 /'*/''[ ,(t 



1 



a 



> i 



K{u) + /?' 7 uK{u) 



t—bu 



{w — t + bu)hQi{w) dw du 



\a(l]K{u) + (3(l]uK{u) 

ft/b 

/!ii(t) + 0(i>), 6 4-0. 



t—bu 



h^iiw) dw du 



K{u) + /3 ( 7 ) uK{u) \ bu /loi(t) du 



Note that, by Condition 1.2, the functions a, f3, a' and f3' are bounded on [0, 1]. We also have: 



t+b 



oUt 



t+b 



t — X 



f^EK^it) = l{a'[ljl^^^m-.)ho,i.)d.+f^^ vvy.^0 b 



Kb{t — x)hoi{x) dx 



t+b 

Ki,{t — x)hQi{x) dx, 



K'ij{t — x)hoi{x) dx 



Ml 



x=0 



so: 



Since we have: 



Eh'^,{t) = j^Ehni{t) = K,it) + 0{b) = h'o,{t) + O (n-i/^) 
hUt)-EhUt) 



sup 

«e[o,6„ 



Op ( n-i/5 



we now also get that 



sup 

u£Si 



The other cases can be treated in a similar way. 



D 



We now get the following result. 

Lemma 4.5. Let the conditions of Theorem 4-1 be satisfied and let, for small ij > 0, F = Fn £ 
B{FQ,r]) be the solution of the equation 

(l)it;hni,hn2,hn,F) = 0,te [0,M]. 

where (p is defined by (4-5). Moreover, let \\ ■ \\ denote the supremum norm. Then: 

(i) With probability tending to one, Fn is strictly increasing on [0, M], and satisfies Fn{t) £ [0, 1], 
t G [0, M], for all large n. Hence, with probability tending to one, Fn coincides with the MSLE 
for large n and 



I -^n — -^0 1 



0, n — )• oo. 
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(ii) 

||i^n-Fo||=Op(n-3/io),n^oo. 

(in) 

\\p — p \\ — n (ri^'^/^\ 

1 1 -t n -'nil — "-^p I "" ) ) 

where Fn is the solution in F of the linear integral equation 

WU\ T^m^W m/r 3{u, t){F{t) - Fojt) - Fju) + Foju)} 
Fit) - Fo(t) + d,,it) |y^^^ Foit)-F,iu) ^^ 

'■^ git, u){Fiu) - Foiu) - Fit) + Fo(t)} 



du 
,=t Foiu)-Foit) 



^ hnlit){l - Foit)} - hn2it)Foit) 

{l-Foit)}giit) + Foit)g2it) 
(4.16) 



+ d,,it)\[ , k^''i\ , du- [ ^^.^M^du], 
^'^ ' \ Ju<t Foit) - Foi^) Ju>v Foiu) - Foit) [' 



and where dp^ is defined by 

Foit){l-Foit)} 



dPoit) 



giit){l- Foit) + g2it) Foit)- 



Proof. Part (i) is an immediate consequence of Lemma 4.3. 

(ii). We get, again using the approach of the impHcit function theorem 10.2.1 in Banach spaces of 

[1], denoting the derivative w.r.t. ihi,h2,h) by Di and the derivative w.r.t. F by D2: 



\Fn — Fo\ 



[D2(pihoi,ho2, ho, Fq) ^ o Di(j)ihoi,ho2, ho, Fo)] (/i„i - /iqi, hn2 - ^02, K - ho) 



(4.17) +Op [Wihni -hoi,hn2 - ho2,hn - h 



I'D) 

where the norm || • || on the left-hand side and the first norm on the right-hand side denote the 
supremum norm on C[0, M] and the norm in the Op-term denotes the norm 

\\ihi,h2,h)\\ = max{||/ii||, ||/i2||, \\h\\s} , 

where the first two norms denote again the supremum norm and the third norm || • II5 is defined 
by (4.8). 

By well-known results in density estimation, we have, if bn x n~^'^. 



max(||/i„i -hoiW, ||/i„2 -/102II ) = Op (n ^''^\/iog 



;n 



The boundary kernels ensure that the rates are not spoiled by what happens at the boundary. So 
we have to determine the rate of convergence of ||/in — /lolls- We get: 



/ 



u:{u,t)eS 



hniu,t) — hoiu,t) du < M^'"^ < / <hniu,t) — hoiu,t)> du 

[Ju:(u,t)eS *- -" 



>. 1/2 
2 I 
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and 

E 



u:{u,t)eS 



hn{u,t) - EK{u,t)y du = Op(^j=Op (n-^/^^ . 



For the bias we get, if bn < u < 7(u) + bn < t < M — bn, where 7 is the function defining the lower 
boundary of the support 5 of (7 in Condition 1.1, 

Ehn{u, t) - ho{u,t) = EKb„{u - Ti)Kb„{t - Ui)Ai2 

{Foiu') - Fo{t')}Kb„{u - t')Kb„{t - u')g{t', u') dt' du' - ho{u, t) 

{Fo{t - bnv) - Fq{u - bnw)}K{v)K{w)g(u - bnW, t - bnv) dv dw - ho(u, t) = O (6^) . 

If 7(u) < t < j{u) + bn and bn < t < u < M — bn, we get: 

Ehn{u, t) - ho{u, t) = 0(7(n) - t). 
The latter case gives a contribution of order 0(6^) to the integral 

\Ehn{u,t) — hQ{u,t)> du. 



J u 



The other cases can be treated in a similar way, with the conclusion that the dominating contribu- 
tion to the integral 

>. 1/2 

\hn{u,t) - hQ{u,t)\ du\ 

u:{u,t)eS ^ ' J 

is coming from the bias near the lower boundary {(t,7„(t)) : t G [0,M]} of S and is of order 
Op{n~^'^^). So the conclusion is: 

(4.18) \\{hnl - hoi,hn2 - ho2,hn " /lo)|| = Op (^-3/1°) . 

The derivative D2 was computed in the proof of Lemma 4.1 (denoted by 84 there) and the 
derivative Di is given by: 

[[Di^{hoi,ho2,ho,Fo)]iA)]it) 
= Bi{t){l - Fo{t)} - B2{t)Fo{t) 



+ Foit){l-Fo{t)}\ f 

lJv=0 



Bi..t) ,,_['■ Bit.u) 



F„(t) - F„(v) J„, F„(u} - Fo(t) 

where Bi , B2 and B are of the form 

Bi = hi — hoi, B2 = h2 — /io2) B = h — /iq. 

Hence, defining F„ by 
(4.19) Fn-Fo = - [D2(p{hoi, ho2, ho, Foy^ o Di(j){hoi, ho2, ho, Fo)] {Ki- hoi,hn2- ho2,hn- ho), 
we get that F = Fn is the solution of the linear integral equation 

[D2(j){hoi, ho2, ho, Fo)] {F - Fq) = - [Di4>{hoi, ho2, ho, Fo)] (hni - hoi, hn2 - ho2, K - ho). 
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which, letting A = F — Fq and {Bi, B2,B) = {hni — hoi,hn2 — /i02,^n — ^o)) boils down to the 
equation 



{hoi{t) + /io2(t)} A{t) - {1 - 2Fo(t)} I / 

VJv=0 



^°(^'^) dv-t%.PM^.du\A[t) 



^Fomi-Fomii -— r^-.— ^ c^.+ T;:crri^;^r" ^- 



^(^'^) dv-r\,,^M^du}, 



Foit) - Foiv) y„=4 Fo(n) - Fo(t) 

'■^ /io(u,0{A(t)-yl(n)} r^-^ /io(t,u){^(t)-^( n)} 

^/,=o {Fo(t)-Fo(n)}2 ^ + y„^, {F,(u) - Fo{t)} 

= B,{t){l-Fo{t)}-B2{t)Fo{t) 
(4.20) 

+ Fo(t){l-Fo(t)}{/ . 

Ut,=o ^o(i) - -Pb(f^) J«=t Fo{u) - Fo{t) 

We have: 

{ho,{t) + ho2{t)}A{t)-{l-2Fo{t)}{f j,^,f^'^, , dv- r j^l^f^i.^^ dulAit) 

[Jy=oFo{t)-Fo{v) 7„=t Fo(M)-Fo(i) J 

= {5i(t)Fo(t)(t) + g2(t){l - i^o(i)}} ^(i) - {1 - 2Fo{t)}{92{t) - 9iit)}A{t) 

= {{1 - Foit)}gi{t) + Fo(i)52(t)} ^(t). 

Furthermore, 

* ho{u,t){A{t) - A{u)} ^^ ^ r^' ho{t,u){A{t)-A{u)} ^^ 



„=o {Fo(t)-Fo(n)}2 7„=, {Fo(n)-Fo(i)P 

* g{u,t){A{t) - A{u)} ^^ r^' git,u){A{t)-A{u)} ^^ 

„=o Fo(i)-i^o(«) "" Ju=t Fo{u)-Fo{t) 
Finally, 

/lOl(i){l-^o(t)}-/i02(t)Fo(t) 

+ Fo(t){l - Fo(t)} ly^^^ ^„(,)_^^(,) ^- - X^^ ^„(,)_^„(,) 

So we obtain the linear integral equation (4.16) by dividing both sides of (4.20) by {1 — Fo(t)}(7i(t) + 
Fo{t)g2{t)._ 

Hence F^ is the solution of the linear integral equation (4.16), and by (4.17) and (4.18), 

II /? _7?„|| — IIP _7?„||_|_r) (n^^/^^ 

IKn -'Oil — iKn -'OIIT^'-'pl"' 



Note that we have: 

||F„-Fo||=Op(n-3/^o 

by the fact that Di(^(/ioi, /io2i ^Oi -^o) and D2<j){hQi, /io2, /loi -^o)""^ are bounded linear mappings and 
hence: 

||F„ - Foil = Op (\\{hni - hoi,hn2 " /io2, K " /io)||) = Op (n^^/^") . 

(iii). The function Fn satisfies the equation (j){hni, hn2, hn, Fn) = 0, where (j) is defined by (4.9). 
Hence: 

hnl{t){l-Fn{t)}-hn2{t)Fn{t) 

' 'M Jv=0 Fnit) - F^{V) Ju=t Fn{u) - Fn{t) J 
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By (ii) we have ||F„ — Fo|| = Op{n^^'^^), and hence: 
hn{v,t) 



I - 

Jv=o Ffi 



dv 



{t)-Fn{v) 

.=0 i^o(i) - ^oW Jv=o Fo{t)-Fo{u) n /' 



and similarly 



/ 



du 



t Fn{u) - Fn{t) 

-Ut^ f- ait, u){F.iu) - Fo(n) -Fit) + F^ ^^ .3/5) 

Hence we get: 

F.it)-F,it)^d.,it)ir aiuMFnit)-m) Fiu)^Foiu)} ^^ 



«=o Foit) - Foiu) 

g{t, u){Fniu) - Foiu) - Fnit) + Foit)} 



M 

,,, ,. „ ,s ,.„,„j — loyu) — i'n\i^) -T J^'oyi-)} , 
u=t Foiu) -Foit) ""j 

^ hnlit){l - Fojt)} - hn2it)Fo{t) 

{l-Foit)}giit) + Foit)g2it) 

- ^^..'" {/:„ ^1^ - - /: ^^^5^ -} - o. (n-) , 

uniformly for t € [0,M], implying 

Fn = - [D2(i>ihoi, ho2, ho, Fo)'^ o Dicpihoi, ho2, ho, Fo)] (hni - hoi,hn2 - ho2,hn - ho) 

= Fn + Op (n-3/5) . 

D 

We now have in principle solved our problem, since we have shown that Fn is locally asymptoti- 
cally equivalent to the solution F„ of a linear integral equation. Since Fn coincides with the MSLE 
for large n, the MSLE is also locally asymptotically equivalent with Fn- However, to get an explicit 
expression for the bias and variance of the MSLE, we now study a still simpler "toy estimator" , 
which turns out also to be locally asymptotically equivalent to the MSLE. 



Lemma 4.6. Let the toy estimator F = Fn^ be defined as the solution of the equation 

9it,u) 
,„>, Foiu) - Foit) 

hniit){l - Foit)} -K2it)Foit) 



m.,-F„(.„{i...„(„{£^^*^.*. / ^^-^.„ 



{l-Foit)}g,it)+Foit)g2it) 

(4-22) +dF,At)lf ,^#^d.-/ i=.#4tt4- 

^ ' ^"^'\Ju<t Foit) -Foiu) J^^, Foiu) -Foit) [ 
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Then, under the conditions of Theorem 4-1, 

V^n {f^ (.) - Foiv) - M|| A AT (0, a{vf) , 

where bn x n~^'^ and P{v), ai{v) and (t{v) are defined as in Theorem 4-1- 

Remark 4.1. In Lemma 4.6 a toy estimator is introduced, which plays a similar role as the toy 
estimator in the study of the ordinary MLE for interval censoring, introduced in [5] and [11] (the 
term "toy estimator" was coined by Jon Wellner). It is called a toy estimator because we cannot 
use it in an actual sample, since Fq is unknown (and is in fact the object we want to estimate). 
Actually, the solution Fn of the linear integral equation (4.16) in part (iii) of Lemma 4.5 is also 
a toy estimator in this sense (but does not produce explicit expressions for the expectation and 
variance of the asymptotic distribution). 

Proof of Lemma 4.6. We have: 

1 " 
hni{v) = - ^ Kb„{v - Ti)Aii 

i=l 

and hence 

var [hni{v)j = -var {Kb„{v - ri)Aii) 



:i^i^,„(.-T,)^(An-Fo(T,))^^ ^°^^^^^-f^^^^^^(^V i^(.)^^n. 



Likewise 



Furthermore, 



Fo{v){l - Foiv)}g2iv) , ^^^^.^ 



var hn2(v) ~ / K(u) du. 

nb„ 



covar {hni{v), hn2{v)) = lEKbJv - Ti)KbSv - Ui){Au - Fo(Ti)) (A13 - Fo{Ui)) 

= -^EKb^v - Ti)Kb^{v - U,)Fo{Ti)Fo{Ui) ~ .^^JoMM^ fxiufdu = 0, 
n nOn J 

using the "separation condition" g{v,v) = 0. So we obtain: 

^^^ ( {l-Fo{v)}K,{v)-Fo{v)K2{v) \ ^ Foiv){l - Fojv)} ({1 - Foiv)ygiiv) + Foiv)^g2iv)) 
""^ V 9i{v){'i^ - Fo{v)} + Fo{v)g2{v) j^ nhn{gi{v){l - Fo{v)] + FQ{v)g2{v)Y 

Furthermore, 

Jt<v Fo{v) - Fo{t) ^ Jt<^, Fo{v) - Fo[t) 
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and hence 

n 



n-ii?K,„(t; - C/i)2 {Fo(f/i) - Fo(Ti)} {1 - F^{\J{) + Fo^)} { / j^^-f ~5)| dt) 

g{t,v)dt / K{ufdu. 



2 



1 /■ l-Fo(«)+Fo(t) __,^ ^,,_^^ / ^^,^,2 



Likewise 

//■ hniv,w) ^ \ 1 /■ 1 - Foiw) + Fo{v) ^ It^(-,2. 

var / -— - — r ^ , , dii; ~ — — / — —-, — r ^ , , — qiv, w) aw / A u) du. 



Finally, 



covar /i„2(?^), / „ ,\ ' ... dt 
V Jt<v Foiv) - Fo{t) 



-EK,,Sv-U^f{A^s-{l-Fo{U,)}){A,2-Fo{U^) + Fo{T^)) f ^^^-^ dt 
-lEK,Jv-U,f{l-Fo{U,)}{Fo{U^)-Fo{T^)} [ i^f"f "J)!, dt 






and similarly: 

(u ( \ [ hn{v,w) \ Fo{v)gi{v) f ^^ 

covar hni{v), / „ , , ?^^T "^ '" z / K(uY du. 

\ Jw>v Fo[w)-Fo{v) J nbn J 

Combining these facts, we obtain that the variance of the right-hand side of (4.22) is given by: 
Fo{v){l - Fo{v)} ({1 - Foiv)}^gi{v) + Fo(z;) V(«)) 



r, I K{u) du 

nbn{gi{v){l - Fq{v)] + Fo{v)g2{v)Y 

+ ^^^{g,iv) + g2{v)} [Kiufdu 



nbn 

Foiv){l - Foiv)} ({1 - Foiv)}^giiv) + Fo{v)^g2iv)) f j^..2 ^^ 
nbn{gi{v){l - Fq{v)} + FQ{v)g2{v)f J "" "" 

+ "^1/ ^7^4^'^^+/ ^'^rl., dw + g,iu)+g2iu)]fKiufdu 
nbn [Jt<v Fo{v) - Fo{t) 7^>„ Fo[w) - Fo{v) J J 

nbn I "^"^ ' \Jt<, Fo{v) - Fo{t) y^>, FoH-Fo(t>) jjj ^' 
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Hence we get that the asymptotic variance at a fixed interior point v of the solution F of the 
equation (4.22) is given by: 



nbnai 

where ai is defined by (4.2). 

We still have to compute the bias. We have: 



K{uydu, 



{1 - Foiv)}{hi{v) - hijv)} - Fo{v){h2iv) - h2iv)} 
gi{v){l - Fo{v)} + Foiv)g2iv) 

/■ h{t,v)-h{t,v) , / N /■ hn{v,u)-h{v,u) 

+ dFniv) / ^ , , ^ , , at — dFn(v) I — ^ , , ^ , , — du 



{1 - Fo{v)}h'{{v) - FoivMiv) 
2{giiv){l-Fo{v)} + Fo{v)g2{v)y 



b„ u K(u) du 



+ IbldpAv) \ I Jf, ^^'^, , dt- I Jf^ ^^'^ , du \ I u^Klu) du, 
' " °^ ^ \jt<v Fo{v) - Fo{t) y„>, Fo{u) - Fo{v) j J ^ ' ' 

where 

(4.23) hiit)= f Kb^it - u)Fo{u) gi{u) du, Mt) = f Kb„it - u){l - Fo{u)} g2{u) du 
and 

(4.24) hit,u)= f Ki,„it-v)Kb„iu-w){Fo{w)-Fo{v)}g{v,w))dvdw. 

Moreover, 

{1 - Foiv)}hi{v) - Fo{v)h2iv) 

^ '^ ^ '^ \Jt<v Fo{v) - Fo{t) y„>, Fo(n) - Fo{v) 

= Fo{v){l - Foiv)} {gi{v) - g2{v)} - Foiv){l - Foiv)} {gi{v) - g2{v)} = 0. 

This yields the result of the lemma, since we can use the central limit theorem for i.i.d. random 
variables on the right-hand side of (4.22), using 

Kl{t){l - Fo{t)} -hn2{t)Fo{t) 

{l-Fo{t)}g^{t) + Fo{t)g2{t) 

"^ ^ \Ju<t m) - Foi^) Ju>t Fo{u) - Fo{t) j 

-1 V f {1 - Fo{t)}Kb„{t - Ti)Aa - Fo{t)Kb„{t - [/,)A,3 
" ^l {1 - Fo{t)}g,it) + Foit)g2{t) 

+ dF,{t)KbAt - u,)A,2 [ i^;;/^ ~J;l du 

Ju<t ^0[t) - ^0[U) 

(4.25) -dF,{t)KbAt-T,)A,2 [ ^'-^^~^f.^ du 

Ju>t ^0{U) - ^0[t) 
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D 

We now show that the solution of the hnear integral equation (4.16) is equivalent in first order 
to the toy estimator of Lemma 4.6. 

Lemma 4.7. Let, under the conditions of Theorem 4-1, Fn^ solve equation (4-22) of Lemma 
4-6 and let Fn he the solution of the linear integral equation (4-16). Then: 

Fn{t) = Fi°y{t) + 0,(n-^'^ 
Proof. It is clear from the definition of Fn that we only have to show: 



,„<i {Fo(t)-Fo(n)}2 
and 

hn{t,u){Fn{u) - F^(u)] 



du = Op {n'^'"^) 



lu>t {Fo{u) - Fo{t)y 
We only prove the first relation, since the proof of the second relation is completely similar. 

Defining A„ = F„ - Fq, Bni = hni - /iio, -B„2 = hn2 - hio and Bn = hn - ho, we have, for a 
bounded weight function w, 



wit, x) {gi{t){l - Foit)} + g2{t)Fo{t)} ^„(t) dt 



i=0 



^l"Mt,-HMu) Am^ ^^ 



r , . ... , .. \ r hQ{u,t)\AJt)- An{u)\ 

+ w{t,xFo{t{l-Fo{t)}{ °^ y; "^; "^ ^U n 

Jt=o 1 Ju=o {Fo{t} - Fo{u)}^ 

[^' ho{t,u) 

L, {Fo(u)-Fo(t)}2 

wit, x) {B„i(t){l - Foit)} - Bn2it)Foit)} dt 
lt=0 

(4.26) 

+ r wit, x)Foit){i - Foit)} I r f "^"'^\ dv - r ^ff^"^.^, du\ dt. 

Jt=o [Jv=o Foit) - Foiv) J^^t Foiu) - Foit) J 

The right-hand side of this equation is clearly Op(n~^ "), uniformly in x S [0, M], where we use the 
fact that the kernel estimators also behave well at the boundary, since we use boundary kernels. 
Since we can write, for example, 

and 

f^ \T7 f+\rt T? u\\ [^^ hoit,u)Aniu) 
wit,x)Foit){l- Foit)} du 



t=o ^' ^ '^^^^ ^^^V„=t{Fo(n)-Fo(t)}2 

" ' '-' hoiu,t) 



wiu,x)Foiu){l -Foiu)} ' du} Anit)dt 

t=o lJu=o {Foit) -Foiu)}^ 

hoiu,t) 



M ( rx 



+ / wiu,x)Foiu){l- Foiu)} J^' ' ' du\Anit)dt 

Jt=x [Ju=o {Foit) - Foiu)}^ 
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the left-hand side of (4.26) is of the form 

/ w{t,x)An{t)dt, 

Jo 
for a bounded weight function w, depending on w, Fq, h^j and /iq. Defining 

a{t)'t^gi{t){l-Fo{t)}+g2{t)Fo{t) 



+ Fo{t){l - Fo{t)} \ f ,j,.^fX\ U2 ^^ + 
[Ju=o{Fo{t) -Fo{u)}^ 



M 



^°^*'") duUt 



which stays away from zero, for t £ [0, M], we can solve the (Predholm) integral equation 



(4.27) + r\{y,x)Fo{y){l-Fo{y)} ^^'^^ 

for the function 1 1— ;■ u;(t, x), t G [0, x], and next use (4.27) to define w{t, x) for t G (x, M]. 
To show that the solution exists, we write, for t < x, the equation in the form 

|5i(t){l - Foit)} + 92it)Foit) + Fo(t){l - Foit)} j'^^ ^^^^y_^'^^^^^^, dudtj w{t, x) 

_ ho{t,x) 

-{Fo(x)-Fo(t)}2 

+ r y^'^^ {Fo(y){l-Fo(y)}^;(;/,x)-Fo(t){l-Fo(t)}u;(t,x)}dj/ 



dy 



ly=t 

ho{y,t) 



{Fo(t){l - Foit)}wit,x) - Fo{y){l - Fo(y)}u;(y,x)} dy, 



ly=o{Fo{t)-Foiy)V 
and use similar techniques as in the treatment of the integral equation 

[dSihoi,ho2,ho,Fo)]{A) = 0, 

in the proof of Lemma 4.1 to show that the homogeneous equation only has the trivial solution 
zero, which means by the Fredholm theory that there exists a (unique) solution. 
For this choice of the weight function w we find: 

'■* ho{u,t)An{u) 

\,^,{Foit)-Fo{u)}^ "" 
t 

w{u,t){gi{u){l - Fo{u)} + g2{u)Fo{u)} An{u) du 

u=0 

+ w{u,t)Fo{u){l-Fo{u)}< J— -^- , ..^ — -dv 

Ju=o [Jv=o {Fo{u) - Fo{v)y 

f^' ho{u,v){An{v)-An{u)} 
t 

w{u,t){Bnl{u){l-Fo{u)}-Bn2iu)Fo{u)}du 
u=0 

+ / w{u,t)Fo{u){l-FQ{u)}i -—— —--dv- -—— —--dv}du, 
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and the expression on the right-hand side is Op (n^^'^), uniformly for t G [0, M]. 
The term 

'^' hQ{t,u)Aniu) 

, {Fo{u) - Fo{t)y " 
can be treated in a similar way. The conclusion of the lemma now follows. D 

Theorem 4.1 now follows from the asymptotic equivalence of the the toy estimator with the 
solution Fn of the linear integral equation (4.16) and the asymptotic equivalence of Fn with the 
MSLE. 

5. Concluding remarks and open problems. In the preceding it was shown that, under the 
so-called separation hypothesis, the MSLE locally converges to the underlying distribution function 
at rate n~^'^, if we use bandwidths 6„ x n~^'^ in the estimates hnj and hn- The asymptotic (normal) 
distribution was also determined. The results can be used to construct a two-sample likelihood ratio 
test, of the same type as the test, discussed in [7] for the current status model, but this is not done 
in the present paper because of reasons of space. 

If the separation hypothesis does not hold, which means that we can have arbitrarily small 
observation intervals, the asymptotic behavior of the MSLE is still unknown. In this situation 
the local asymptotic limit distribution for the ordinary MLE is also still unknown, although it is 
conjectured that the rate n~^'^, holding under the separation hypothesis, is improved to the rate 
(nlogn)"^'^ in this case. There even exists a conjectured limit distribution in this case, put forward 
in [5] (see also [11]). Supporting evidence for this conjecture is given in a simulation study in [10], 
but a proof is still missing. The latter paper also gives simulation results for the SMLE, and it 
is conjectured that the SMLE will locally have a similar behavior as the MSLE if the separation 
hypothesis hold. The asymptotic behavior of the SMLE again has to be deduced from an associated 
integral equation. This matter is further discussed in [10]. 
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